Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering

نویسندگان

  • Olfa Nasraoui
  • Hichem Frigui
  • Raghu Krishnapuram
  • Anupam Joshi
چکیده

The proliferation of information on the World Wide Web has made the personalization of this information space a necessity. An important component of Web personalization is to mine typical user pro les from the vast amount of historical data stored in access logs. In the absence of any a priori knowledge, unsupervised classi cation or clustering methods seem to be ideally suited to analyze the semi-structured log data of user accesses. In this paper, we de ne the notion of a \user session" as being a temporally compact sequence of Web accesses by a user. We also de ne a new distance measure between two Web sessions that captures the organization of a Web site. The Competitive Agglomeration clustering algorithm which can automatically cluster data into the optimal number of components is extended so that it can work on relational data. The resulting Competitive Agglomeration for Relational Data (CARD) algorithm can deal with complex, nonEuclidean, distance/similarity measures. This algorithm was used to analyze Web server access logs successfully and obtain typical session pro les of users.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Web Access Logs Using Relational Competitive Fuzzy Clustering

The proliferation of information on the World-Wide Web has made the personalization of this information space a necessity. An important part of Web personalization is to mine typical user profiles from the vast amount of historical data stored in access logs. In this paper, we define the notion of a “user session” and a new distance measure between two web sessions that captures the organizatio...

متن کامل

Web User Profiling Using Relational Fuzzy Clustering

User profiling is a fundamental task in Web personalization. In this paper, we use a relational fuzzy clustering to discover user profiles from Web log data. Precisely, a modified version of the CARD algorithm, called CARD+, is proposed to discover clusters embedded in the Web usage data and derive profiles modeling the real user preferences. Experimental results on log data extracted from log ...

متن کامل

Relational fuzzy approach for mining user profiles

Capturing the characteristics and preferences of Web users into user profiles is a fundamental task to perform in order to implement forms of personalization on a Web site. In this paper, we present a relational fuzzy clustering approach to extract significant user profiles from session data derived from log files. In particular, a modified version of the CARD clustering algorithm is proposed i...

متن کامل

Learning Web Users Profiles With Relational Clustering Algorithms

In the context of web personalization and dynamic content recommendation, it is crucial to learn typical user profiles. Although there exists several approaches to mine user profiles (such as association rules or sequential patterns extraction), this paper focuses on the application of relational clustering algorithms on web usage data to characterize user access profiles. These methods rely on...

متن کامل

Rough set based User profiling for Web Personalization

Web usage mining has recently emerged as a basis for extracting useful user access pattern information, such as user profiles, from enormous amounts of Web log data for web site personalization. A profile can consist of a set of URLs that are relevant to the sessions assigned to a given cluster. Once these profiles are discovered, they can be exploited as part of an automated personalization on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • International Journal on Artificial Intelligence Tools

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2000